Target Predictions using LINCS Data
نویسنده
چکیده
Identifying the binding targets of small molecules is an essential process in drug discovery and development. The two conventional approaches include high throughput screening (HTS) and computational structural docking. HTS suffers from its expensive cost and time-consuming procedure, while the computational methods reply on simplifying assumptions that often leads to less accurate results. In this project, we developed machine learning based approaches to efficiently predict drug targets using the massive LINCS data. We extracted meaningful features from the LINCS data and integrated them with information from other genomic data, and build a random forest based classifier that achieves remarkable prediction accuracy. Our strategy provide an fast and efficient way of predicting drug targets, and can naturally serve as a pre-pruning step for the computationally expensive structural based approaches.
منابع مشابه
Comparing MicroRNA Target Gene Predictions Related to Alzheimer's Disease Using Online Bioinformatics Tools
Introduction: The prediction of microRNAs related to target genes using bioinformatics tools saves time and costs of the experimental analyses. In the present study, the prediction of microRNA target genes relevant to Alzheimer’s Diseases (AD) were compared with the experimentally reported data using different bioinformatics tools. Method: A total of 41 microRNAs associated with 21 essential ge...
متن کاملComparing MicroRNA Target Gene Predictions Related to Alzheimer's Disease Using Online Bioinformatics Tools
Introduction: The prediction of microRNAs related to target genes using bioinformatics tools saves time and costs of the experimental analyses. In the present study, the prediction of microRNA target genes relevant to Alzheimer’s Diseases (AD) were compared with the experimentally reported data using different bioinformatics tools. Method: A total of 41 microRNAs associated with 21 essential ge...
متن کاملCompound signature detection on LINCS L1000 big data.
The Library of Integrated Network-based Cellular Signatures (LINCS) L1000 big data provide gene expression profiles induced by over 10 000 compounds, shRNAs, and kinase inhibitors using the L1000 platform. We developed csNMF, a systematic compound signature discovery pipeline covering from raw L1000 data processing to drug screening and mechanism generation. The csNMF pipeline demonstrated bett...
متن کاملData Portal for the Library of Integrated Network-based Cellular Signatures (LINCS) program: integrated access to diverse large-scale cellular perturbation response data
The Library of Integrated Network-based Cellular Signatures (LINCS) program is a national consortium funded by the NIH to generate a diverse and extensive reference library of cell-based perturbation-response signatures, along with novel data analytics tools to improve our understanding of human diseases at the systems level. In contrast to other large-scale data generation efforts, LINCS Data ...
متن کاملBuilding Concordant Ontologies for Drug Discovery
In this study we demonstrate how we interconnect three different ontologies, the BioAssay Ontology (BAO), LINCS Information FramEwork ontology (LIFEo), and the Drug Target Ontology (DTO). The three ontologies are built and maintained for three different projects: BAO for the BioAssay Ontology Project, LIFEo for the Library of Integrated Network-Based Cellular Signatures (LINCS) project, and DTO...
متن کامل